AITopics | label corruption

05b12f103c9e613efc4c85674cdc9066-Supplemental-Conference.pdf

Neural Information Processing SystemsApr-24-2026, 09:17:52 GMT

artificial intelligence, exp, machine learning, (18 more...)

Neural Information Processing Systems

Genre: Research Report (0.46)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Add feedback

Trimmed Maximum Likelihood Estimation for Robust Learning in Generalized Linear Models

Neural Information Processing SystemsApr-24-2026, 09:17:48 GMT

We study the problem of learning generalized linear models under adversarial corruptions. We analyze a classical heuristic called the iterative trimmed maximum likelihood estimator which is known to be effective against label corruptions in practice. Under label corruptions, we prove that this simple estimator achieves minimax near-optimal risk on a wide range of generalized linear models, including Gaussian regression, Poisson regression and Binomial regression. Finally, we extend the estimator to the more challenging setting of label and covariate corruptions and demonstrate its robustness and optimality in that setting as well.

Add feedback

322f62469c5e3c7dc3e58f5a4d1ea399-Supplemental.pdf

Neural Information Processing SystemsFeb-8-2026, 00:34:27 GMT

corruption, deep ensemble, multisw ag, (11 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

05b12f103c9e613efc4c85674cdc9066-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-7-2026, 07:19:05 GMT

exp, generalized linear model, regression, (16 more...)

Neural Information Processing Systems

Country:

Asia > Afghanistan > Parwan Province > Charikar (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report (0.46)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Add feedback

05b12f103c9e613efc4c85674cdc9066-Paper-Conference.pdf

Neural Information Processing SystemsFeb-7-2026, 07:19:01 GMT

Under label corruptions, we prove that this simple estimator achieves minimax near-optimal riskonawiderange ofgeneralized linear models, including Gaussian regression, Poisson regression and Binomial regression.

artificial intelligence, machine learning, regression, (18 more...)

Neural Information Processing Systems

Country: Asia > Afghanistan > Parwan Province > Charikar (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.49)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.34)

Add feedback

Trimmed Maximum Likelihood Estimation for Robust Generalized Linear Model

Neural Information Processing SystemsDec-23-2025, 17:16:41 GMT

We study the problem of learning generalized linear models under adversarial corruptions.We analyze a classical heuristic called the \textit{iterative trimmed maximum likelihood estimator} which is known to be effective against \textit{label corruptions} in practice. Under label corruptions, we prove that this simple estimator achieves minimax near-optimal risk on a wide range of generalized linear models, including Gaussian regression, Poisson regression and Binomial regression. Finally, we extend the estimator to the much more challenging setting of \textit{label and covariate corruptions} and demonstrate its robustness and optimality in that setting as well.

name change, robust generalized linear model, trimmed maximum likelihood estimation, (7 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.48)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.48)

Add feedback

Using Trusted Data to Train Deep Networks on Labels Corrupted by Severe Noise

Dan Hendrycks, Mantas Mazeika, Duncan Wilson, Kevin Gimpel

Neural Information Processing SystemsNov-20-2025, 23:38:23 GMT

The growing importance of massive datasets used for deep learning makes robustness to label noise a critical property for classifiers to have.

classifier, machine learning, natural language, (20 more...)

Neural Information Processing Systems

Country:

North America > United States > Illinois > Cook County > Chicago (0.04)
Asia > Afghanistan > Parwan Province > Charikar (0.04)
North America > United States > Nevada (0.04)
(2 more...)

Genre: Overview (0.34)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Communications (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.49)

Add feedback

322f62469c5e3c7dc3e58f5a4d1ea399-Supplemental.pdf

Neural Information Processing SystemsOct-2-2025, 14:57:27 GMT

artificial intelligence, deep ensemble, machine learning, (14 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Trimmed Maximum Likelihood Estimation for Robust Generalized Linear Model

Neural Information Processing SystemsOct-9-2024, 10:40:46 GMT

We study the problem of learning generalized linear models under adversarial corruptions.We analyze a classical heuristic called the \textit{iterative trimmed maximum likelihood estimator} which is known to be effective against \textit{label corruptions} in practice. Under label corruptions, we prove that this simple estimator achieves minimax near-optimal risk on a wide range of generalized linear models, including Gaussian regression, Poisson regression and Binomial regression. Finally, we extend the estimator to the much more challenging setting of \textit{label and covariate corruptions} and demonstrate its robustness and optimality in that setting as well.

corruption, robust generalized linear model, trimmed maximum likelihood estimation, (4 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (1.00)

Add feedback

Learning to grok: Emergence of in-context learning and skill composition in modular arithmetic tasks

He, Tianyu, Doshi, Darshil, Das, Aritra, Gromov, Andrey

arXiv.org Machine LearningJun-4-2024

Large language models can solve tasks that were not present in the training set. This capability is believed to be due to in-context learning and skill composition. In this work, we study the emergence of in-context learning and skill composition in a collection of modular arithmetic tasks. Specifically, we consider a finite collection of linear modular functions $z = a \, x + b \, y \;\mathrm{mod}\; p$ labeled by the vector $(a, b) \in \mathbb{Z}_p^2$. We use some of these tasks for pre-training and the rest for out-of-distribution testing. We empirically show that a GPT-style transformer exhibits a transition from in-distribution to out-of-distribution generalization as the number of pre-training tasks increases. We find that the smallest model capable of out-of-distribution generalization requires two transformer blocks, while for deeper models, the out-of-distribution generalization phase is \emph{transient}, necessitating early stopping. Finally, we perform an interpretability study of the pre-trained models, revealing the highly structured representations in both phases; and discuss the learnt algorithm.

pre-training task, sequence, task vector, (12 more...)

arXiv.org Machine Learning

2406.0255

Country: North America > United States > Maryland > Prince George's County > College Park (0.04)

Genre: Research Report (0.81)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.67)

Add feedback